Strategy-Based Technology for Estimating MT Quality

نویسندگان

  • Liugang Shang
  • Dongfeng Cai
  • Duo Ji
چکیده

This paper introduces our SAU-KERC system that achieved F1 score of 0.39 in the world-level quality estimation task in WMT2015. The goal is to assign each translated word a “OK” or “BAD” label indicating translation quality. We adopt the sequence labeling model, conditional random fields (CRF), to predict the labels. Since “BAD” labels are rare in the training and development sets, recognition rate of "BAD" is low. To solve this problem, we propose two strategies. One is to replace “OK” label with sub-labels to balance label distribution. The other is to reconstruct the training set to include more "BAD" words.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Utilizing Evaluation in Networked Machine Translation*

In this paper we develop the prerequisites of a processing strategy to be employed in a networked machine translation environment. Networked Machine Translation (NMT) is a new MT paradigm based on the recent and emerging developments in the field of information technology (IT), World Wide Web (WWW) technology and language technology (LT), in particular networked computing. This new MT paradigm ...

متن کامل

Improving Bandwidth-power Efficiency of Homogeneous Wireless Networks Using On-meet Threshold Strategy (RESEARCH NOTE)

Over two decades, a problem of location dependent has been focused for improving the communication Bandwidth-Power Efficiency of homogeneous networks. The efficiencies of communication links are weakened by the Hidden Terminal Problem.  Thus we propose a Fine – Tune Strategy for analyzing the On-Off communication region. We were observed that the proposed technique had been able to track and mo...

متن کامل

Evaluation of Phase Locking and Cross Correlation Methods for Estimating the Time Lag between Brain Sites: A Simulation Approach

Introduction: Direction and latency of electrical connectivity between different sites of brain explains brain neural functionality. We compared efficiency of cross correlation and phase locking methods in time lag estimation which are based on local field potential (LFP) and LFPspike signals, respectively. Methods: Signals recorded from MT area of a macaque’s brain was used in a simulation ...

متن کامل

Power Allocation Strategies for MIMO Radar Waveform Design

The role of waveform design is central to effective radar resource management for state-of-the art radar systems. The waveform shape employed by any radar system has always been a key factor in determining the performance and application. The design of radar waveform to minimize mean square error (MSE) in estimating the target impulse response is based on power allocation using waterfilling. ...

متن کامل

Providing a Catalogue of Language Resources for Commercial Users

Language resources (LR) are indispensable for the development of tools for machine translation (MT) or various kinds of computer-assisted translation (CAT). In particular language corpora, both parallel and monolingual are considered most important for instance for MT, not only SMT but also hybrid MT. The Language Technology Observatory will provide easy access to information about LRs deemed t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015